Sentiment Analysis of Code-Mixed Languages Leveraging Resource Rich Languages
نویسندگان
چکیده
Code-mixed data is an important challenge of natural language processing because its characteristics completely vary from the traditional structures standard languages. In this paper, we propose a novel approach called Sentiment Analysis Code-Mixed Text (SACMT) to classify sentences into their corresponding sentiment - positive, negative or neutral, using contrastive learning. We utilize shared parameters siamese networks map code-mixed and languages common space. Also, introduce basic clustering based preprocessing method capture variations transliterated words. Our experiments reveal that SACMT outperforms state-of-the-art approaches in analysis for text by 7.6% accuracy 10.1% F-score.
منابع مشابه
Sentiment Analysis of Code-Mixed Languages leveraging Resource Rich Languages
Code-mixed data is an important challenge of natural language processing because its characteristics completely vary from the traditional structures of standard languages. In this paper, we propose a novel approach called Sentiment Analysis of Code-Mixed Text (SACMT) to classify sentences into their corresponding sentiment positive, negative or neutral, using contrastive learning. We utilize th...
متن کاملSentiment Analysis of Code-Mixed Indian Languages: An Overview of SAIL_Code-Mixed Shared Task @ICON-2017
Sentiment analysis is essential in many real-world applications such as stance detection, review analysis, recommendation system, and so on. Sentiment analysis becomes more difficult when the data is noisy and collected from social media. India is a multilingual country; people use more than one languages to communicate within themselves. The switching in between the languages is called code-sw...
متن کاملLanguage-Specific Sentiment Analysis in Morphologically Rich Languages
In this paper, we propose languagespecific methods of sentiment analysis in morphologically rich languages. In contrast of previous works confined to statistical methods, we make use of various linguistic features effectively. In particular, we make chunk structures by using the dependence relations of morpheme sequences to restrain semantic scope of influence of opinionated terms. In conclusio...
متن کاملPreparing Bengali-English Code-Mixed Corpus for Sentiment Analysis of Indian Languages
Analysis of informative contents and sentiments of social users has been attempted quite intensively in the recent past. Most of the systems are usable only for monolingual data and fails or gives poor results when used on data with code-mixing property. To gather attention and encourage researchers to work on this crisis, we prepared gold standard Bengali-English code-mixed data with language ...
متن کاملRobust Cross-Domain Sentiment Analysis for Low-Resource Languages
While various approaches to domain adaptation exist, the majority of them requires knowledge of the target domain, and additional data, preferably labeled. For a language like English, it is often feasible to match most of those conditions, but in low-resource languages, it presents a problem. We explore the situation when neither data nor other information about the target domain is available....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2023
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-031-23804-8_9